06 Apr 2020
Image taken from: The Turing Way book
Image taken from: The Turing Way book
No, it is a building block that’s essential for data science!
gitOpen source tools are:
Image taken from: The Turing Way book
Tracking the three Ws: Who made Which change and Why?
| Tool | Cost | Where is master? | Advantage |
|---|---|---|---|
| Github | Free | Online | Huge user base |
| Gitlab | Free | Local or Online | Continuous integration |
| Bitbucket | $$$ | Cloud | Cloud security |
| Azure | $$$ | Cloud | Cloud security |
We want to look back and be able to repeat our work easily and quickly.
What are the benefits?
Image taken from: The Turing Way book
Before we start, here are a few things to consider:
Image taken from: The Turing Way book
The building blocks of a RAP:
… are useful in their own right, each will improve the auditability, speed and quality of your work.
RAP has been successfully rolled out in 10s of projects across the UK government.
It is now part of the best practice documentation.
Work continues across the government to roll out RAP to more projects.
govdown - R package to recreate our UK government website template.
drake - R package to streamline reproducible pipelines in R.
snapcraft - python package to solve those dependency issues!
We aim to raise the aid community’s capacity for using Data Science to meet the Sustainable Development Goals.
We’ll link up Data Scientist mentors from the campus with mentees. Together we’ll set plan out a project, describing:
As we plan out our project, we’ll be considering those sometimes difficult aspects of RAP:
The UK goverment RAP website.
A free RAP course to teach you all you need to know.
How the Data Science Campus sets its coding standards.
A new open-source book from the Alan Turing insititute setting out how to do reproducible data science.
The github page for this presentation and other materials is here.
Get involved in the UK government RAP champion network here.
Many of the beautiful images used in this presentation were taken from The Turing Way book.
Full citation:
The Turing Way Community, Becky Arnold, Louise Bowler, Sarah Gibson, Patricia Herterich, Rosie Higman, … Kirstie Whitaker. (2019, March 25). The Turing Way: A Handbook for Reproducible Data Science (Version v0.0.4). Zenodo. http://doi.org/10.5281/zenodo.3233986